Validation Sequence Optimization: A Theoretical Approach

نویسندگان

  • Gediminas Adomavicius
  • Alexander Tuzhilin
چکیده

T need to validate large amounts of data with the help of the domain expert arises naturally in many dataintensive applications, including data mining, data stream, and database-related applications. This paper presents a general validation approach that generalizes different expert-driven validation methods developed for specialized validation problems. In particular, we model the validation process as a sequence of validation operators, explore various properties of such sequences, and present theoretical results that provide for better understanding of the validation process. We also address the problem of selecting the best validation sequence among the class of equivalent sequence permutations. We demonstrate that this optimization problem is NP-hard and present two heuristic algorithms for improving validation sequences.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Online Supplement to “Validation Sequence Optimization: A Theoretical Approach”

Optimization: A Theoretical Approach” Gediminas Adomavicius Department of Information and Decision Sciences, Carlson School of Management, University of Minnesota, 321 19th Avenue South, Minneapolis, Minnesota 55455, USA, [email protected] Alexander Tuzhilin Department of Information, Operations, and Management Sciences, Stern School of Business, New York University, 44 West 4th Street, New York, N...

متن کامل

Establishing an Argument-Based Validity Approach for a Low-Stake Test of Collocational Behavior

Most of the validation studies conducted across varying test application contexts are usually framed within the traditional conceptualization of validity and therefore lack a comprehensive framework to focus on test score interpretations and test score use. This study aimed at developing and validating a collocational behavior test (CBT), drawing on Kane's argument-based approach to validity. F...

متن کامل

Global Optimization of Stacking Sequence in a Laminated Cylindrical Shell Using Differential Quadrature Method

Based on 3-D elasticity approach, differential quadrature method (DQM) in axial direction is adopted along with Globalized Nelder–Mead (GNM) algorithm to optimize the stacking sequence of a laminated cylindrical shell. The anisotropic cylindrical shell has finite length with simply supported boundary conditions. The elasticity approach, combining the state space method and DQM is used to obtain...

متن کامل

Some Results about the Contractions and the Pendant Pairs of a Submodular System

Submodularity is an important  property of set functions with deep theoretical results  and various  applications. Submodular systems appear in many applicable area, for example machine learning, economics, computer vision, social science, game theory and combinatorial optimization.  Nowadays submodular functions optimization has been attracted by many researchers.  Pendant pairs of a symmetric...

متن کامل

Early Stopping as Nonparametric Variational Inference

We show that unconverged stochastic gradient descent can be interpreted as a procedure that samples from a nonparametric approximate posterior distribution. This distribution is implicitly defined by the transformation of an initial distribution by a sequence of optimization steps. By tracking the change in entropy over these distributions during optimization, we form a scalable, unbiased estim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • INFORMS Journal on Computing

دوره 19  شماره 

صفحات  -

تاریخ انتشار 2007